Optimality of feedback control strategies for qubit purification 
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Recently two papers [K. Jacobs, Phys. Rev. A 67, 030301(R) (2003); H. M. Wiseman and J. 
F. Ralph, New J. Physics 8, 90 (2006)] have derived control strategies for rapid purification of 
qubits, optimized with respect to various goals. In the former paper the proof of optimality was 
not mathematically rigorous, while the latter gave only heuristic arguments for optimality. In this 
paper we provide rigorous proofs of optimality in all cases, by applying simple concepts from optimal 
control theory, including Bellman equations and verification theorems. 
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I. INTRODUCTION 



In the absence of decoherence, monitoring (that is, 
continuous- in-time weak measurement) of a qubit observ- 
able such as o z will eventually purify the qubit. However, 
the process of purification for finite times can be affected 
by applying Hamiltonian controls. Based upon results 
for discrete (but non-projective) measurements, Jacobs 
P, Q derived a strategy to maximize the average purity 
of the qubit at any given time. His strategy requires 
the controller to maintain the qubit state with (er 2 ) = 0. 
Wiseman and Ralph Q considered two different goals: 
first to maximize the fidelity of the qubit with any given 
state at any given time, and second, to minimize the av- 
erage time required to attain a given level of purity (or 
fidelity). For the first goal, they argued that Jacobs' 
strategy (with a trivial modification) was again optimal. 
For the second goal, they considered the strategy which 
maintains (a y ) = (cr z ) = 0, which is a no-control strategy 
provided that these conditions are met initially (and that 
any Hamiltonian not proportional to a z can be nullified). 
They showed that this outperforms Jacobs' control strat- 
egy, and argued that it was the optimal strategy. How- 
ever, none of these claims of optimality were proven using 
rigorous results from continuous time stochastic control 
theory. 

Control strategies derived from heuristic arguments 
may well fail to attain optimality. Some examples include 
continuous quantum error correction in Ref. and grad- 
ual discrimination of non-orthogonal states in Ref. [5| (in 
which Jacobs is well aware of the non-optimality) . How- 
ever, if a simple feedback strategy is conjectured to be 
optimal, then the truth of the conjecture can be demon- 
strated straightforwardly using standard techniques from 
optimal control theory — the so-called verification the- 
orems. The purpose of this paper is to highlight this 
fact, by showing that both of the above strategies (that 
of Jacobs, and that of Wiseman and Ralph), are indeed 
optimal for the different goals as claimed. We introduce 
the system in Sec. II, as well as reviewing some terminol- 
ogy and results from optimal control theory. In Sec. Ill 



we discuss and refine Jacob's problem, and prove the op- 
timality of his strategy. In Sec. IV we do likewise for 
Wiseman and Ralph's problem, and we conclude with a 
brief discussion in Sec. V. 



II. THE SYSTEM 

In continuous measurement theory a basic role is 
played by the quantum filtering equation 0, 0, H| , also 
known as a stochastic master equation 0, [lC| ■ This can 
be obtained from an underlying quantum Markov model 
describing the coupling of the system to an external en- 
vironment. The filtering equation is a stochastic differ- 
ential equation whose solution at time t is a (random) 
state matrix p t , such that the quantity TrLYp t ] corre- 
sponds to the conditional expectation of an arbitrary op- 
erator X with respect to the observations up to time t. 
In a more traditional quantum measurement perspective 
(see for example Ref. pt is known as the condi- 

tional system state, and represents the observer's beliefs 
about the system. The filtering equation also plays a 
fundamental role in quantum feedback control, because 
most control problems separate into a filtering step and a 
classical control problem for the filtering equation 0, || . 
That is, the optimal feedback control algorithms make 
use only of pt and have thus been called state-based [Tl| 
and Bayesian [l2| in the physics literature. In control 
theory such strategies are known as separated strategies. 

The goal of the particular control problems studied in 
[H S Q is to maximize the rate of purification of the 
conditional state pt in a suitable sense. We consider pre- 
cisely the same system as in [H, S monitoring one 
operator (a z ) for a qubit with the ability to control the 
Hamiltonian. In a solid-state context this monitoring 
could be realized by a quantum point contact [l3|, and 
in an optical context it could arise from an off-resonant 
coupling of the qubit to the electromagnetic field which 
is observed continuously in time via homodyne detection 
[f4| . Here we denote by a x , a y , a z the well-known Pauli 
matrices. We assume without loss of generality that the 
initial state po satisfies Tr[poCy] = 0. In that case we 
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only need a control Hamiltonian proportional to a y . 
quantum filtering equation for this model is 



The 



dp t = -ifl t [a y /2, p t ] dt + (p z p t o z - p t ) dt 

+ {°zPt + Pt°z - 2Tr[a,p t ]pt) dW t . {2.1) 

Here fit is the control input and Wt is the innovations 
Wiener process @, 0, 0] ■ It is related to the measurement 
record Y t on which p t is conditioned by 



dY t = 2Tr[p t a z ]dt + dW t 



(2.2) 



Rewriting Eq. (|2.1[) in terms of the Bloch vector com- 
ponents (x t ,yt,z t ) = Tr[(a x ,a y ,a z )p t ], we obtain 

dz t = 2(1 - z 2 ) dW t - fl t x t dt, (2.3) 
dx t = -2x t dt - 2z t x t dW t + Cl t H dt, (2.4) 

and yt = 0. We now introduce polar coordinates z t = 
Rt cos 9t and Xt = Rt sin 9 t . It turns out to be convenient 
in calculations to work with St = 1 — R 2 instead of Rt- 
S t is sometimes known as the linear entropy of the qubit. 
In these variables the filtering equation reads 

dS t = -45 t { [1 - (1 - S t )u 2 ] dt + y/1 - S t u t dW t ) , 

(2.5) 



d0 t = n t dt + f(et,St)dt + g(9t,St)dW t . 



(2.6) 



Here / and g are functions whose explicit form we will 
not need and we have written u t for cos 8 t . 

The equations (|2.5|) and (|2.6|) are already too compli- 
cated to be amenable to optimal control. In order to 
proceed, the papers [l], 0, Q assume that, rather than 
controlling Qt , one can directly control ut at every time 
to take any value in the interval [—1,1]. This could be 
seen as a singular limit of the above system, in the sense 
that an infinitely strong driving field fl t would ostensi- 
bly allow one to set the angle 9 t to any desired value. 
This type of assumption is commonly used to reduce 
open-loop optimal control problems in NMR to explic- 
itly computable form [ijj • In the present case, it is less 
clear that this is a reasonable approximation, as singular 
controls are incompatible with the assumptions used to 
derive the quantum filtering equation [f| Q. Our goal 
here, however, is to demonstrate optimality of the prob- 
lems ultimately considered in [l|, @, Q , so we will adopt 
their assumptions in the following. 

Before we proceed to the problems considered in [J, 
Q, we review some terminology. A control strategy of 
the form ut = u(t,St), where u(t,s) is a deterministic 
function, is called a Markov control. It is easily verified 
using the ltd rule that for any function f(t, s) that is C 1 
in t and C 2 in s, we have 



E s (f(t,S t )) = E s / 
Jo 

+ f(0,s), 



^, Sv ) + L^f{t',S v ) 



dt' 
(2.7) 



which is known as Dynkin's formula. Here ~E S (X) denotes 
~E(X\So = s), the expectation of X given the initial con- 
dition So = s. Moreover, for any v € [—1,1], we have 
defined the differential operator C v by 



where 

Vf(s) =8.s 2 (l-.s) 



-4s 



am 

ds 



v 2 Vf(s), 



d 2 f(s) 
ds 2 



+ 4s(l - s) 



ds 



(2.8) 



(2.9) 



III. JACOBS' PROBLEM 

Jacobs P, considered the following control problem: 
Fix an arbitrary terminal time T > 0, and find a feedback 
control u t (with values in [—1,1]) that maximizes the pu- 
rity of the qubit at time T. Specifically, Jacobs chose 
a goal of minimizing the cost function E S (SV), the ex- 
pected value of the random variable St, for So = s. This 
choice of cost function was made "to keep the calculations 
tractable" [l[ . While this choice gives a well-defined con- 
trol problem with a well-behaved solution (as discussed 
below), it was pointed out by Wiseman and Ralph Q 
that the cost function E(SV) lacks a clear physical mo- 
tivation. The reason is that it is a nonlinear function 
of the quantum state, and hence docs not correspond to 
the expected outcome of any physical process that can 
be performed on the qubit. 

Say the qubit is being purified for some particular 
purpose for which ideally the qubit would be in a par- 
ticular pure state | </>)), as discussed in Ref. 0]. Then 
the cost function should be based on the performance of 
the qubit, as can be empirically determined independent 
of the preparation procedure (for example, by a second 
party who will impose the cost on the preparer). Any 
such cost function depends only upon E((/)|pt|^ , ) ) the ex- 
pected fidelity of the state E[pr] with the fiducial state 
For example, for cr z \(f)) = + \4>), the fidelity, which we 
would wish to maximize, is [l+E(i?T cos 9t)]/2. Since we 
are working under the assumption that 9t can be set ar- 
bitrarily, all that we require is to maximize E(i?y). That 
is, the best motivated cost function for qubit purification 
is E s (— y/1 - S T ) rather than E S (S T )- 

We note that Jacobs' argument for the optimality of 
his control strategy with respect to his cost function relies 
upon the linearity of that cost function in S and hence 
does not apply to the cost function E s (— y/1 — St)- Nev- 
ertheless, below we prove rigorously that Jacobs' strategy 
is optimal for both cost functions, and the calculation is 
essentially the same in both cases. We do this by consid- 
ering the general cost function 



J[u,s] = E s (F(S%)), 



(3.1) 



where F can be given either by F(x) — x or by F(x) = 
x. Here the superscript u on S 1 " means that S" 
under the control strategy u. 



is propagated by Eq. 
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Before presenting the theorem we use to prove that 
Jacob's strategy is optimal, we first give a a heuristic 
derivation of the so-called Bellman equation, see e.g. fl6l ] . 
This derivation provides some insight into the problem, 
and the Bellman equation will reappear in the theorem. 
In situations (unlike the present case) where one did not 
have a candidate optimal control strategy, the Bellman 
equation may also suggest such a candidate. 

To derive the Bellman equation, consider a given 
Markov strategy u, and define the cost-to-go at time t 
given that St = s by 



J[u,s,t]=~E,(F(S%)\St = s) 



(3.2) 



so that J[u, s] = J[u, s, 0]. Now we assume that there 
exists an optimal control strategy that is, a strategy 
that minimizes the cost function. We also assume that 
it is Markov; that is, u£ = u*(t,St), where u*(t,s) is a 
deterministic function. From these assumptions, we can 
define the value function to be the optimal cost-to-go: 



V(t,s) = J[u*,s,t}. 



(3.3) 



Since u\ is a strategy that minimizes the cost over 
the full interval [0, T] it seems obvious that it will also 
minimize the cost-to-go over the interval [£, T]; that is, 



V(t, s) = J[u*, s, t] = inf J[u, s, t], 



(3.4) 



where the infimum is over all Markov control strategies. 
This is a form of the the principle of optimality and can 
indeed be made rigorous, see e.g. [16|. Since at this point 



we are merely arguing heuristically, we do not need a 
rigorous proof. The idea behind the proof is, however, 
simple to explain. If a better strategy u exists on the 
interval [i, T], we would simply use u* up to time t and 
would proceed with the better strategy u from time t 
onwards and obtain a lower cost over the interval [0, T]. 
This contradicts optimality of u* and hence such a u can 
not exist. 

The principle of optimality leads to the following re- 
cursion for the value function (optimal cost-to-go) 

V(t, s) = inf E (V(f, S$)\S? = s) , V(T, s) = F(s), 

(3.5) 

where < t < t' < T and the infimum is over the Markov 
control strategies on the interval [t, t']. To see this, note 
that E(V(t',S$)\S? = s) is the cost-to-go of a control 
strategy that follows u on the interval [t, t'} and u* on the 
interval [t',Tj. If we take t' = t + dt, expand V(t',S t >) 
according to Ito's rule, and re-arrange the terms, then we 
arrive at the following Bellman equation 



^(M) = - M£ v V(t,s) 
at ue[o,i] 



(3.6) 



BV 

= 4s — (t,s)- inf v 2 VV(t,s), (3.7) 
os ve[o,i] 

with terminal condition V(T,s) — F(s). Since u* is an 
optimal strategy, the infimum is attained at v = u*(t, s). 



Moreover, it is clear that u*(t, s) G {0,1}, so that the 
Bellman equation can be rewritten as 



dV 
dt 



4s 



dV 
ds 



min{0, VV} = 0. 



(3.8) 



Jacobs' strategy chooses Ut = for < t < T. This 
corresponds to instantaneously rotating the qubit onto 
the x-axis at t = 0, and then actively maintaining it 
there. We now show that it yields a solution to the Bell- 
man equation, as it should if it were an optimal strat- 
egy. For ut = Eq. (|2.5|) gives the deterministic solution 
S t = S e~ 4t , so that St = S'te" 4 ^"'). If this strategy 
were optimal then V(t, s), the optimal cost-to-go, should 
be constant at F(St), and hence should be given by 



V{t,s) = F(se 



-4(T-t) 



(3.9) 



For this choice it is simple to verify that for both cases 
F(x) — x and F{x) = —y/l — x, and for all s G [0, 1] and 
t G [0,T], that 



VV 
dV 
~dt 



> 0, 



4s ■ 



dV_ 

ds' 



(3.10) 
(3.11) 



Thus Eq. (|3.9p is indeed a solution of Eq. 

We emphasize that this result does not prove that Ja- 
cobs' strategy is optimal. A strategy that is optimal will 
give a solution to the Bellman equation, but a solution 
of the Bellman equation does not necessarily correspond 
to an optimal strategy. However, it is simple to check 
whether it does correspond to an optimal strategy, as the 
theorem below shows. Instead of assuming the existence 
of an optimal strategy u* , we now consider a solution V 
to the Bellman equation Eq. (|3.6p with terminal condi- 
tion V(T, s) ~ F(s) and we define a candidate optimal 
Markov strategy u* by 



u*(t,s) G argmin£'"F(i, s) 
ve[o,i] 



(3.12) 



where the right-hand-side means the value of v G [0, 1] 
that minimizes C v V(t, s). 

Theorem [TJ which is a special case of a standard result 
in control theory [l5|, Thm. 8.1], provides a set of crite- 
ria that guarantees that a particular control strategy u t is 
optimal. We apply the theorem to our candidate strategy 
with V as in Eq. (|3.9|) . Note that critcria[5]and[3]togcther 
arc equivalent to V satisfying the Bellman equation and 
u being defined as in Eq. (|3.12p . Criterion 2] is the ter- 
minal condition for the Bellman equation. Criterion [T] 
is a smoothness condition on V, which from Eq. (|3.9[) is 
readily seen to hold for the two functions F we are con- 
sidering. The Theorem therefore allows us to conclude 
that our candidate strategy is indeed optimal. 

Theorem 1 (Verification theorem) Suppose there ex- 
ists a Markov control u : [0,T] x [0, 1] — > [—1, 1] and a 
function V : [0, T] x [0, 1] — > ffi such that 
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1. V(t, s) is C 1 on [0, T) and C 2 in s. 

2. For all t £ [0, T] and s £ [0, 1] , 

dV, 



dt 



{t,s)+C u{t ' s) V{t,s) = 0. 



3. For all t £ [0, T], v £ [-1, 1] and s £ [0, 1], 
dV 



dt 



(t,s)+CV(t,s) > 0. 



4- For all s £ [0, 1], V satisfies the terminal condition 
V(T,s)=F(s). 

Then the Markov control strategy u is optimal, i.e. 
J[u, s] < J[u, s] for all s £ [0, 1] and any control Ut taking 
values in [—1,1], and moreover V{f), s) = J[u, s]. 

Proof. Let u t be any control strategy. Then using 
Dynkin's formula and the first, third and fourth condi- 
tions in the verification theorem, we obtain 



V(0,s) = Es(V(T,S T )) 

-E s / 
Jo 

< E S (V(T,S T )) = E S (F(S T )) = J[u,s}. 



^(t,S t ) + £^V(t,S t ) 



dt 



Using the second condition, we similarly conclude that 
V(0, s) = J[u, s}. Hence J[u, s] < J[u, s] for any u. □ 



IV. WISEMAN AND RALPH'S PROBLEM 

As well as the variation on Jacob's problem discussed 
above, Wiseman and Ralph [3[ also considered a quite dif- 
ferent control problem. Their problem fixes some thresh- 
old h £ (0, 1), and seeks a feedback control ut that min- 
imizes the average time at which the linear entropy first 
hits the threshold value of h. That is, the cost function 
is 

J[u, s] = E s (t u ) , t u = inf {t : < h} . (4.1) 

As before, the superscript u on reminds us that 5* 
is propagated by Eq. (|2.5[) under the control strategy u. 
It is clear from the discussion at the start of the pre- 
ceding section that, under our assumptions, this prob- 
lem is equivalent to minimizing the time taken to pre- 
pare a given pure state (in the y — plane) with fidelity 
(1 + a/T - 70/2. 

From consideration of the stochastic evolution of 
log(l — St), Wiseman and Ralph Q claimed that the 
strategy u = 1 was optimal in the asymptotic (/i<C 1) 
limit. This strategy requires instantaneously rotating (if 
necessary) the qubit onto the z-axis at t = 0, then let- 
ting fl t = for t > 0. Here we verify that this is indeed 



the optimal strategy for all h. We proceed as before, 
first finding the Bellman equation and exhibiting a solu- 
tion using the strategy proposed by Wiseman and Ralph. 
Then we turn the argument around and use the solution 
to the Bellman equation to prove rigorously that the can- 
didate strategy is indeed optimal. 

To derive the Bellman equation, we again start by as- 
suming the existence of an optimal Markov strategy u£ . 
The value function, the optimal cost-to-go, is given by 



V(M)=e((t u * -t) + 



S? =sj , t > 0, 1 > s > h. 

(4.2) 

Here (t u — t) + = max{0, (r" — t)}, the time left after 
time t until the threshold h is first hit. Note that it 
follows immediately from the definition that V(t, s) = 
for all t > and s < h. The principle of optimality 
then yields the following recursion for the value function 
(where t' A t u means mm{t' , r"}) 

V(t, s) = inf E ((*' A t u ) - t + V(t', S£ AT ») |S? = s) . 

U 

(4.3) 

Here < t < t', h < s < 1 and the infimum is over all 
Markov control strategies on the interval [£, t']. To see 
this, note that E ((£' A r") —t + V(t', S?, AtU ) \S? = s) is 
the cost-to-go of a control strategy that follows u on the 
interval \t,t'\ and it* after time t' . If t' is smaller than 
t u then we incur a cost t' — t on the interval [t, t'\ plus 
the optimal cost-to-go if we are at at time r. If t' 
is greater than t" then we incur a cost r" — t' over the 
interval [*,*']. Note that V(t',S T «) = V(t',h) = 0. 

If we take t' = t + dt, expand V according to Ito's 
rule and re- arrange the terms, then Eq. (|4.3[) leads to the 
following Bellman equation 



dV 
dt 



1 



inf C v V(t,s) 
«e[o,i] 



(4.4) 



BV 

l-4s— (t,s)+ inf v 2 VV{t,s), (4.5) 
us «e[o,i] 



with boundary condition V(t, h) = 0. As before, the 
minimand is linear in v 2 so this can be rewritten as 

dV dV 
- — = l-4s— + min{0,W}. (4.6) 
ut os 

Having obtained the Bellman equation, we now need to 
find a solution. Wiseman and Ralph Q show by explicit 
calculation that for u t = 1 the expected cost-to-go is 



J[u,s,t] = E((T u -t)+\S? 

= [ 7 (h)- 7 (s)]/4, 



(4.7) 
(4.8) 



where for convenience we have defined, for x £ [0, 1], 

7 (x) = Vl - x atanh(Vl - x) . (4.9) 

Note that J[u, s, t] is independent of t. This is to be 
expected, since the control strategy u t = 1 is not time 
dependent, so the time left until we hit the threshold h, 
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starting at s, does not depend on when s was actually 
reached. Now, if Wiseman and Ralph's proposed strat- 
egy is optimal, then from the heuristic arguments above, 
taking 



V(s) = [j(h) - 7 (a)]/4 



(4.10) 



should satisfy the Bellman equation Eq. (|4.6|) . 

It is easily verified that for all s G [h 7 l] Eq. (|4.10j) 
satisfies 

dV 

4s— (s)-W(s) = 1, (4.11) 



9s 



< 0, 



(4.12) 



as well as V(h) = 0. Thus the strategy of Wiseman and 
Ralph does indeed give a solution of the Bellman equa- 
tion Eq. (|4.6[) . Moreover, it is clear that the infimum 
in Eq. f|4. 6|) is attained at v = 1. Thus if we turn the 
problem around and define a candidate optimal control 
strategy via Eq. (|3 . 1 2f> using the solution Eq. (|4.10[) to the 
Bellman equation, this gives the Wiseman-Ralph strat- 
egy w*(£, s) = 1 for all t > and s € ]/i, 1]. Note that 
we do not need to worry about s G [0, /i) — in this case 
t = 0, which can not be improved upon! 

We emphasize again that the above argument only es- 
tablishes that the Wiseman-Ralph strategy is a candidate 
optimal strategy. To verify that it is optimal we proceed 
as before, via a second verification theorem. This is an- 
other special case of a standard result in control theory 
[l9l . Thm. 3.1]. We apply the Theorem to our candidate 
strategy with V as in Eq. (|4. 10[) . Note again that criteria 
[H and [3] together are equivalent to V satisfying the Bell- 
man equation and u being defined as in Eq. (|3 . 1 2[) . Cri- 
terion |4] is the terminal condition for the Bellman equa- 
tion. Criterion [1] is a smoothness condition on V, which 
from Eq. (|4.10p can be verified to hold. This Theorem 
therefore allows us to conclude that the Wiseman-Ralph 
strategy is indeed optimal. 

Theorem 2 (Verification theorem) Suppose there ex- 
ist a stationary Markov control u : [h,\] — > [—1, 1] such 
that E s (t w ) < 00, and a function V : [h, 1] — ► K such 
that 

1. V(s) is C 2 in s. 

2. For all s G [h, 1], C u ^V(s) + 1 = 0. 

3. For all v G [-1, 1] and s G [h, 1], C v V(s) + 1 > 0. 

4. V(h) = 0. 

Then the stationary Markov control strategy u is optimal, 
i.e. J[u,s] < J[u,s] for any control iit taking values in 
[—1, 1], and moreover V(s) = J[u,s]. 

Proof. Let u t be any control strategy. Just as in the 
proof of the previous theorem, we would like to apply 
Dynkin's formula, but with the random terminal time 
r" replacing T. This is possible only if E s (t") < 00 
However, note that if E s (r u ) is infinite, then it is 
immediate that 



That is, we can safely restrict our attention to control 
strategies u for which E s (t 11 ) is finite. Therefore we 
can now safely apply Dynkin's formula using the random 
terminal time r" [181 ] . This gives, using the conditions of 
the theorem, 



V{s) = E s 



V(S%) - f C %t V{Sf) dt 
Jo 



< E, 



V (S% 



dt 



E s (V (S^O+t") 



JK S ] = E s (r»)<E s (r' 



J[u, s] 



But note that = h by the definition of t u ; so 
V(s) < E s (V(h) + r fi ) = E s (r fi ) = J[u, s}. 

Using the remaining (second) condition, we similarly 
conclude that V(s) = J[u,s}. Hence J[u, s] < J[u,s] for 
any u. □ 



V. CONCLUSION 

We have highlighted a simple technique from optimal 
control theory to verify optimality of candidate control 
strategies. This paper was prompted by the absence of 
such arguments in the current physics literature (but see 
3). In particular, we have verified the optimality of pre- 
viously proposed control strategies for two problems of 
current interest [l], 0, Hi HO, HH, [HJ in nonlinear quantum 
feedback control. 

Optimal control theory is notoriously difficult for non- 
linear systems, and the reader might be surprised at our 
explicit computations. This simplicity is only made pos- 
sible by the rather "violent" assumption that we can di- 
rectly control cos 9 t rather than f2 t . A much more realis- 
tic control problem would be to find an optimal f2 t under 
additional finite energy constraints on the control. Such 
problems, however, are not analytically tractable. Var- 
ious strategies with finite controls were investigated nu- 
merically for a particular (solid-state) setting in Ref. [2l[ , 
with no claims of optimality. 



Acknowledgments 

We thank Ramon van Handel for motivating and guid- 
ing this work, and for many stimulating discussions. 
HMW acknowledges support by the Australian Research 
Council and the State of Queensland, and LB acknowl- 
edges support by the ARO under grant number W911NF- 
06-1-0378. 



6 



[1] K. Jacobs, Phys. Rev. A 67 030301(R) (2003). [13 
[2] K. Jacobs, Proc. SPIE 5468 355 (2004). 
[3] H. M. Wiseman and J. F. Ralph, New J. Phys. 8, 90 [14 
(2006). 

[4] C. Ahn, A. C. Doherty, and A. J. Landahl, Phys. Rev. [15 

A 65, 042301 (2002). 
[5] K. Jacobs, Quant. Inf. and Comp. 7, 127 (2006). [16 
[6] V. P. Belavkin, pp. 310-324 in Proceedings XXIV 

Karpacz winter school (R. Guelerak and W. Karwowski, [17 

eds.), World Scientific, Singapore, (1988). [18 
[7] L. Bouten, M. Guta and H. Maassen, J. Phys. A 37, 3189 

(2004). [19 

[8] L. Bouten and R. Van Handel, \math-ph/051 10M\ (2005). 
[9] H. M. Wiseman and G. J. Milburn, Phys. Rev. A 47, [20 

1652 (1993). 

[10] H. M. Wiseman, Quantum Semiclass. Opt. 8, 205 (1996). [21 
[11] A. C. Doherty, S. Habib, K. Jacobs, H. Mabuchi, S. M. 

Tan Phys. Rev. A 62, 012105 (2000). [22 
[12] H. M. Wiseman, S. Mancini, and J. Wang, Phys. Rev. A 

66, 013807 (2002). 



Hsi-Sheng Goan, G. J. Milburn, H. M. Wiseman, and He 
Bi Sun, Phys. Rev. B 63, 125326 (2001). 
R. van Handel, J. K. Stockton, and H. Mabuchi, J. Opt. 
B: Quantum Semiclass. Opt. 7, S179 (2005). 
W. H. Fleming and H. M. Soner, Controlled Markov Pro- 
cesses and Viscosity Solutions (Springer, Berlin, 2006) 
H.J. Kushner, Introduction to stochastic control (Holt, 
Rhinehart and Winston Inc., 1971) 
N. Khaneja, et al. J. Mag. Res. 162, 311 (2003). 
B. Oksendal, Stochastic Differential Equations, fifth ed. 
(Springer, Berlin, 2000) 

B. Oksendal and A. Sulem, Applied stochastic control of 
jump diffusions, (Springer, Berlin, 2005) 

J. Combes and K. Jacobs, Phys. Rev. Lett. 96, 010504 
(2006) 

E. J. Griffith, C. D. Hill, J. F. Ralph, H. M. Wiseman, 
and K. Jacobs, Phys. Rev. B 75, 014511 (2007 ) 

C. D. Hill and J. F. Rai^ph\quant-ph/0610156\ (2006). 



